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Abstract. The Full Strategy Minority Game {FSMG) is an instance of the Minority Game 
(MG) which includes a single copy of every potential agent. In this work, we explicitly solve the 
FSMG thanks to certain symmetries of this game. Furthermore, by considering the MG as a 
statistical sample of the FSMG, we compute approximated values of the key variable /N in 
the symmetric phase for different versions of the MG. As another application we prove that our 
results can be easily modified in order to handle certain kind of initial biased strategies scores, 
in particular when the bias is introduced at the agents' level. We also show that the FSMG 
verifies a strict period two dynamics (i.e., period two dynamics satisfied with probability 1) 
giving, to the best of our knowledge, the first example of an instance of the MG for which this 
feature can be analytically proved. Thanks to this property, it is possible to compute in a simple 
way the probability that a general instance of the AIG breaks the period two dynamics for the 
first time in a given simulation. 



1. Introduction 

The Minority Game (MG) was introduced in 1997 by Challet and Zhang [T] in an attempt 
to catch essential characteristics of a competitive population in which an individual achieves 
the best result when she manages to be in the minority group. In the MG, there are agents 
(usually odd), that at each step of the game must choose or 1. Let A^o(*) (resp. Ni(t)) be 
the number of agents choosing (resp. 1) at the step t (note that NQ{t) + Ni{t) = N). The 
winners are those who happen to be in the minority group (i.e., the minimum between NQ{t) 
and Ni{t)). The only information available for the agents is the system state ft G {0,1}™", 
that is updated after each step of the game. The parameter m is an input of the game and 
defines the information-processing capacity of the agents. In the classical version of the MG, 
/i is defined to be an endogenous variable determined from the sequence of minority sides in 
the last m steps, although other kinds of updating rules can be found in the literature [21 [3]. 
In early works, /x was also called history and m the agents' memory. Therefore, the number of 
possible states is H = 2™. Agents play using the so-called strategies. A strategy is a function 
that assigns a prediction (0 or 1) for each of the possible states. In this way, there are 
different strategies. Each agent has s strategies at her disposal (we use s = 2 in this work), 
randomly chosen with replacement from the complete set of strategies at the beginning of the 
game (note that it is possible for an agent to have two identical strategies, and for two agents to 
have the same pair of strategies). At every step of the game, each of the strategies that correctly 
predicted the winning side is awarded a virtual point, regardless of use in that step. At each 
step, each agent plays what her best-performing strategy (in terms of virtual points) predicts. 
If the two strategies have the same number of virtual points, the agent randomly chooses one of 
them. 

Given N agents, an instance of the MG is a particular assignment of strategies to the agents. 
We define a configuration £ of the game to be a pair £ = {^A,I}, where Ai = ...} is a 

sequence of states (generated by any updating rule) and I is an instance of the MG. 
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The observable z = < {Ni — N/2)'^ /N >g is the most studied and instructive variable [3j 
in the MG. It measures the population's waste of resources by averaging on time and over 
different configurations £, the quadratic deviation of the number of agents that chose a fixed 
side (for example A^i) from N/2. The notation cr^/A^ = z \s usual in the literature. One of the 
reasons why the MG has attracted so much attention is that for certain values of the parameters 
m, A^, and s, the variable z is smaller than that obatined for a game in which each of the N 
agents randomly chooses between the two sides. In the latter, Ni(t) corresponds to a binomial 
random variable Bi{N, 1/2), hence z = a'^/N = 1/4. It is interesting to note that z reflects 
that the population as a whole achieves more resources, but does not reveal how that wealth 
is distributed among the agents. Ho et al. adequatedly redefined a Gini index for the MG [5] 
which showed that whenever z takes its minimum value, the inequality among the agents is 
maximized. 

For a = T-L/N = 2™/A^, it was proposed [HE] that the curve given by z against a is indepen- 
dent of N (i.e., z could be regarded as a function of the parameter a) although longer simulations 
of the MG showed that the curve does depend on within the range 0.01 ^ a ^ 0.2, in the 
sense that the variable z increases as N increases [7]. However, such invariance does arise in the 
MGrand introduced by Cavagna [2], where a different updating rule is used: the system state 
is randomly chosen (with uniform distribution) at each game step. In [3], the authors propose 
a periodic updating rule of period which we call MGper throughout this work. Like in the 
MG, z increases with for the MGper (in the curve given by z against a) for the same range 
of a. 

In the region given by a ^ 1, crowd effects arise at some game steps. This behavior is related 
to a dynamics known as Period Two Dynamics (PTD), which was observed for the first time 
by Savit et al. [8]. To understand this dynamics, that plays an important role in the rest of 
the article, for each game step t we define the parity array Vl to be an array of categorical 
variables recording the parity (odd or even) of the number of appearences of each state in the 
first t — 1 game steps. More precisely, we have P| G {O, E}^ and if we identify any state fi with 
the integer number given by the binary expansion of plus one (so that /i can be thought as 
an integer ranging from I to H), we have that V^di) = O (resp. E) if fi has appeared an odd 
(resp. even) number of times in the first t — 1 steps of the game. We assume = {E, E, - ■ ■ ,E), 
as at the beginning of the game any state has appeared zero times, hence an even number of 
times. For a simple example let us consider that {0, 0, 1, 0, 1, 0, 1, • • • } is the sequence of minority 
sides obtained in a game with m = 2. Then if the standard updating rule is used, fl^ = 00, 
fi^ = 01, fl^ = 10, etc., and at time step i = 6, the states //i = 00, /i2 = 01, //s = 10 and /X4 = 11 
have appeared 1,3,2 and times respectively, hence Vj- = {0,0, E,E). We will drop t when 
referring to a generic time step, using Ve{ii) instead. 

The PTD can be summarized in the following way: if at some time step, for some state ^ 
we have Vs{iJi) = O, then in the next (and hence even) appearance of /x, the outcome of the 
game is very likely to be the opposite to that obtained in the previous appearance of Broadly 
speaking, this dynamics is due to the fact that on even appearances of /x crowds of agents will 
move together to the side rewarded in the previous odd appearance. When crowds emerge in the 
game, their contribution to z is very important. Furthermore, crowd effects are the reason why 
z is a large number in this region, showing that fewer resources are allocated to the population 
as a whole. 

In a little more than a decade, there have been many attempts from different backgrounds 
to give a formal framework to the game, and to analytically reproduce the results observed 
in computer simulations [9]. Marsili et al. proposed an analytical approach by resorting to 
sophisticated tools from statistical physics, such as non-equilibrium stochastic dynamics [10] and 
the replica method [TT]; Heimel et al. proposed an analytical approach based on the generating 
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functional analysis [12] . A geometric approach is based on the use of the Reduced Strategy Space 
defined as the subset of independent strategies plus the corresponding totally anticorrelated 
strategies [131 [H]. Based on this approach, Johnson et al. derived approximate expressions for 
the attendance fluctuations [151 US] • [IZj ^ mean field approach to the MG is presented and 
the standard deviation of the key variable A^i — A'^o is computed by introducing a simplified 
framework. The key idea in [T7] consists in defining a particular instance of the game so that, 
for any value of m, all possible strategies and all possible agents (each one represented by a 
possible pair of strategies in the case s = 2) take part in the game. We call this particular 
instance the Full Strategy Minority Game (FSMG). For s = 2, the number of potential agents 
is = (2) + C, where the first term represents all agents with two different strategies, and 
the second term represents the number of agents whose two strategies are identical. Thus, the 
number of agents of the FSMG is a function of m. A/" = J\f{m). Certain symmetries which 
appear only partially in the MG can be fully exploited in the FSMG, and this approach leads 
to interesting theoretical results in the PTD region. 

In this paper, we show that the calculations given in [T7] can be highly simplified and easily 
applied to other variants of the MG, as the above-mentioned M Grand and MGper- By taking 
advantage of its inherent symmetries, we analytically solve the FSMG. In particular, we show 
the role of the number of states in odd occurrences in the calculation of the variable z arriving 
to similar conclusions as that given in [18] by means of a different approach. What is more, our 
calculations also apply to certain kinds of initial biased scores if the bias is introduced at the 
agents' level, i.e. given a positive Uq, any agent decides to assign Uq virtual points "a priori" 
to any of their strategies with probability pi,. On the other hand we define the Strict Period 
Two Dynamics (SPTD) as a PTD with probability 1, and show that SPTD is the characteristic 
dynamics of the FSMG. Moreover, by considering the MG as a statistical sample of size N 
of the FSMG, we are able to predict the validity region of PTD for the A/G in a precise way. 
This theoretical result agrees fairly well with computer simulations. 

2. The Full Strategy Minority Game 

2.1. Symmetry of the FSMG. Let ^ = 2™, £ = 2^ and A" = (f ) + £ be the number of 
different states, different strategies and different potential agents for a MG with parameter m. 
If we take an arbitrary number N of agents it is clear that, in randomly generated instances 
/ of the MG, the following may happen: (1) some potential agent may not participate in the 
game or (2) a multiple copy of the same agent may participate in the game. In the Full Strategy 
Minority Game both cases are excluded: by construction the number of agents in the FSMG 
is set to A" and a single copy of every potential agent is allowed. 5-^ and Sc (also known as 
the Full Strategy Space) denote the set of states and strategies respectively. Throughout this 
article, the symbol (| stands for the cardinality of a set, hence we have ^Sy, = %, and = C 
For a given state ^ G S-y, the subset of strategies in Sc that predict a certain outcome o for the 
state ^ is denoted by Sc,^^d- For an arbitrary outcome o G {0, 1} we will denote the opposite 
side by ~ o. It is clear that Sc,^i~^o U cS£,^^~o = Sc and that ft^^^^^o = tt'^r.^^^s = ^/2. 
This means that for each state ^, the number of strategies predicting o and the number of 
strategies predicting ~ o coincide. This symmetry together with the assumption of the SPTD 
allow us to make a remarkable analytic simplification of the game. Indeed, let us consider an 
arbitrary configuration S of the FSMG, and its state sequence Ai (note that there exists only 
one instance / for each m in the FSMG). Suppose that at step s a certain state /i* = /u occurs 
for the first (and hence odd) time, and call o the winning side after the voting round s. In that 
case the parity array verifies V'^~^^{fi) = O (since V\^) = E if I < s) and strategies belonging to 
the set Sc,^^o are rewarded with a virtual point. Suppose now that at the time step s', s' > s, 
fi occurs for the second time (i.e., jl^ = m)i then the SPTD implies that the winning side after 



THE FULL STRATEGY MINORITY GAME 



4 



the voting round s' will be ~ o, and thus exactly the other half of the strategies {Sc,^^'^o) will 
correctly predict the minority side. Therefore, if we remove the point previously assigned to 
the set of strategies Sc,f_i^o instead of adding a new point to strategies belonging to Scfi^^o, 
the dynamics of the game will remain unchanged. Taking into account this rule, it is clear that 
the number of virtual points accumulated for any strategy, at any time step, ranges from to 
7i. In equation (jl5p we show that the FSMG necessarily verifies the SPTD, and hence the 
SPTD assumption can be removed in the previous argument. As far as we know this is the first 
example in the literature of a game enjoying this property. 

In the sequel we denote with S/^ i the set of strategies with / virtual points, and with S^i^^^q i 
(resp. Sc^ij,^r^o,i) the set of strategies with I virtual points that predict o (resp. ~ o) for fi. 

2.2. Strategies of the FSMG. Given a parity array Vs at an arbitrary time step, we define rie 
(resp. Uo) as the number of even (resp. odd) symbols stored in Vs, we have that < rig, no < "H 
and rio = Ti — rig. Below, we show that the number ^S^^i of strategies with exactly I virtual 
points depends only on Vs, and for a given state /i the values of tt'5£^^_^o ; and ^S^^^^^^^^q^i depend 
only on Vs and the parity of fi. 

li Ue = then no strategies can have virtual points due to the fact that only after odd 
appearances of states the strategies may get a point, i.e., '^Scfi = 'i,Sc = and 'i^Sc,i = for 
1 < i < %. On the other hand, if rig = Ti — 1 then there is only one state, say ^i, in an odd 
occurrence. In this case, strategies can have virtual points due to this state only. But now, 
regardless of the outcome o (i.e., o = or o = 1) after this odd ocurrence of /ii, strategies 
belonging to Sc,fii^o will have exactly one virtual point, and those in the complement S £ni—^r^o 
will have exactly points, hence '^Sc,i = '^Sc,o = >C/2, and '^Sc,i = for 2 < i < %. In the 
general case for parity numbers and Ue we observe that the maximum number of virtual points 
for a given strategy is bounded by Uq, since even ocurrences do not add any points. 

Let < / < no, and let fiki i l^k^o be the states recording odd numbers of ocurrences, and 
01,02,- " 5 be the outcomes after the last ocurrence of each //fc. . The number ]^Sc,i can be 
easily computed by noting that a strategy has exactly I points if it has succesfully predicted the 
outcome (i.e., it has won a virtual point) I times out of the Hq time steps corresponding to the 
last occurrences of the odd states, and has unsuccesfully predicted the outcome in the remaining 
Ho — I time steps. For instance, if a strategy belongs to 

then it has succesfully predicted the outcome at least / times. If in addition it belongs to 

then it has unsuccesfully predicted the outcome at least Hq — I times. So, the strategies belonging 
to 

have exactly I points. We easily find that 

since only C/2 strategies belong to Sc,fi^.^^di and from those strategies only half succesfully 
predict the outcome 02 for the state /i^j, i.e., ^{Sc,fif,_^^di H Sc,fif,^^02) = >C/2^, and so on until 
we obtain the number £/2' of strategies that have succesfully predicted oi, • • • , o^. But from this 
set exactly half fails to predict (i.e., half of them predict ~ Oi+i) and so on. In this way we 
finally arrive to ([T|). Since there are ("°) ways of choosing I states from the set {nki, • • • 1 
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of states on some odd occurrence, we get 

An important remark is the following: for a given state fj, and a given parity array V, the set 
Sc^i can be decomposed in two sets as Sc^i = Sc^ix^o,i U Sc,^-fr^d,i- Whether or not these sets 
have the same cardinality depends on the parity of the state fi. Indeed, if fx is on an even state 
(i.e., V{fi) = E), we have 

^Sc,^^o,i = ^Sc,,-,^s,i = ^Sc,i/2 = (^l'^ £/2-°+i (3) 

since we can apply the same argument used to get ([2]) to the set Sc,fi^o (and Sc,^^^d)- Indeed, 
the set Sc,^-fd has cardinality C/2 and the same symmetry properties than Sc,i (due to the fact 
that V{ijl) = E). The situation is different if we take a state jj. with V{ij) = O, in fact, the 
cardinality of the sets Sc,^^o,i, Sc,^^r^d,i cannot be equal. Indeed, if after the last appearance 
of n the outcome of the game was o, then the strategies belonging to Sc,^^o have at least one 
virtual point, so those strategies with / points have successfully predicted the outcome exactly 
I — 1 times out of the no — 1 time steps corresponding to the remaining states recording an odd 
ocurrence, i.e., 

tt5z:,^-.o,z= (7_"/)/:/2"° (4) 

and those belonging to Sc^fi^r^d.i have successfuly predicted the outcome exactly I times out of 
the 77-0 — 1 time steps corresponding to all the states recording an odd ocurrence, i.e., 

tt5£,M^~o,« = (''°^"^)/:/2"°. (5) 

We again have Sc,i_i^d,i + Sc,fj.~^^d,i = Sc,h which holds since (""-"i^) + ("T^) = ("°)- 

2.3. Agents and dynamics of the FSMG. Let us focus on the M agents of the FSMG. With 
Mo and Nr^o we denote the number of agents choosing the option o and the opposite option ^ o, 
respectively. When needed, we explicitly write A/J and to denote the dependence of these 
variables on the time step t. 

For a given time step t and a parity array we know from the previous section the distri- 
bution of virtual points for strategies ([2]) . Recall that we have n* and n* states on even and odd 
occurrences respectively for a given P| . We drop t from Uq and rig for the sake of simplicity. 
Now we can predict the polls result if agents of the FSMG are required to process a certain state 
IXp (called the present state) given hy an arbitrary updating rule. To this end, we call undecided 
agents to the agents that have both strategies equally rewarded and predict opposite sides for //p, 
i.e., one strategy belongs to Sc,^p-^d,i and the other to Sc,^p^r^b,i , for certain I. The number of 
undecided agents is denoted by Mu- On the other hand, decided agents are the remaining ones, 
so that Ad = M — Mu- We also introduce the number of decided and undecided agents who 
choose the option o to be Md^, and Mug, respectively. Clearly, we have 

Mo = Mds + Mus and M^o = Md^g + Mu^s ■ 

In the following, we will show that Mu, Mdg and Md^g can be explicitly calculated for a given 
state Hp. The value Mu can be obtained by noticing that undecided agents have, for each /, one 
strategy from Sc,fip^o,i and the other one from Sc,fip^^o,i- The total number of agents in this 
situation is 

Ml = ^ ^<Sc,^lp-^^,l^<Sc,^ip'^-^o,l■ (6) 
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The sum ranges over all possible virtual points that strategies may have for the given parity 
array V^. On the other hand, decided agents that choose 5 for the present state have either two 
strategies from Sc,f_ip-^d (first and second terms in equation ([7])) or one strategy from Sc^^p^o,i 
and the other one from Sc,ij.p^r^o,j, with I > j (last term in ([7])): 

Let us first assume that V^ifJ-p) = E, i.e., the present state fip has appeared an even number 
of times up to time step t — 1. Suppose that now agents are required to process the state fip 
(i.e., fip = fi^), so this state is in an odd appearance. Suppose now that o was the outcome of 
the game after the last (and hence even) appearance of fip. From ([3]), ([7]) and ([6]) it is easy to 
check that Mdg = A/^^g, and that 

rio / \ 2 



In short, if agents process a state in an odd appearance, then we have 



Md, - M,^, =0, Mu = £V22-+2 j^2n„^| _ 

Let us assume now that V^^fJ-p) = O (i.e., the present state is in an even appearance). In this 
case, we use (jl]) and ([5]) to obtain Mu, A/rfg, and Md^g. 

Mu = /:V2'- E ("° r ') = ^V2'"° ('^";; , (lo) 

that is obtained by using the Vandermonde's identity. 



C/2 



^fd - 

In short, if agents process a state in an even appearance, then 



Nu = £V2'"°(^^";^ ^^). (13) 
These equations lead to the following remarkable result 

Md, - Md^, -Mu = /:V2'"° ^ °' ^^^^ 

that says that FSMG verifies necessarily the SPTD when the present state is in an even occur- 
rence. Indeed, (|14p was obtained under the assumption that o was the outcome of the game 
after the previous appearance of /ip and shows that 

^d, - Md^, > Mu (15) 
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at the occurrence of the present state. Then, no matter what undecided agents do, the minority 
side win be ~ o after the agents have processed the new appearance of /ip. 

One may think that since SPTD was an assumption, then (jlSp is not surprising. However, we 
can now remove such an assumption. In fact, let us consider a configuration £ of the FSMG, 
and its sequence of states 7W = {jli,jl2, • • • }• Since = {E, E, - ■ ■ ,E), in time step 1 agents 
process /xi, and we have Vg{fJ-i) = E. So we are on an odd appearance of /ii and we easily see 
that our previous arguments lead us to ([9]). The first repeated state will occur in at most H 
steps (the size of the state space), until then all states will occur in an odd number of times 
and virtual points of strategies are bounded by Ti, so we can still apply our arguments leading 
once more to ([9]). Since the first time that a state needs to be processed in an even occurrence 
will take place during the first T-L + 1 steps, then the strategies' virtual points range from to 
7i. Due to this fact, the previous arguments also apply for the first even appearance of a state, 
hence (jl5p holds and enforces the SPTD, allowing us to apply the modified points assignment 
rule for the first time. By repeating the argument for the second, third, etc. even appearances 
of states we can conclude the validity of the SPTD in general. 



3. From the FSMG to the MG 

3.1. Distribution of the agents' choices. An instance / of the MG is a (random) sample 
of size N from the M agents of the FSMG. Consider an experiment consisting in randomly 
extracting (with reposition) a sample of size from a box containing M agents of three different 
types, namely A/^g of type 1, Md^i, of type 2 and Mu of type 3. Suppose that after extracting 
the sample, we obtain N^- agents of type 1, N^^. agents of type 2, and Nu agents of type 3, so 
that 



Nd, + Nd^^ + Nu = N. (16) 

In this setting, (A^,, A^^,, A^) is a random variable with a multinomial probability distribution. 
Therefore, the probability of obtaining A^,, A'^^, and A^ is: 

Pix^ = Nd„x, = Nd^,,xs = ATJ = (^^^), (^^), pf^^P2'^^^^pf. (17) 



where 



Pl = -^, P2 = -^, P3 = -^. (18) 



Our aim is to characterize the variable A5 (i.e., the number of agents who choose side o). 
To this end, consider a second experiment: once a sample with Nd^, ^d^s and A„ agents of 
each type is obtained, we need to know the distribution of A^,, i.e., the number of undecided 
agents choosing o. Since undecided agents randomly pick among their strategies, a reasonable 
assumption is that A^g ~ Bi{Nu, 1/2). From No = Nd- + A^g one can easily show that 

A5~5i(A,pi+p3/2) 

or, using that Pi + P2 + Ps = ^, 

A5~Si(A,l/2 + (pi-p2)/2). (19) 



Equation (|19p will be useful in the following sections. 
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3.2. Analytic calculations for different updating rules. For a fixed side o, the expected 
value 

fj2 = < {No - N/2f > (20) 

over every possible configuration S- measures the waste of the population's resources. Our 
previous calculations allow us to compute the expected values on odd = < (iVg — N/2)'^ 
and even Ug = < {No — iV/2)^ >e occurrences of the present state //p, in order to find a closed 
expression for (pOj) . 

Given a random variable y ~ Bi{N,p) it is known that Var{y) = Np{l —p), and < y >= Np, 
hence from (1191) we have that 



Var{No) = iV/4(l - {pi - p2f) = iV/4 



< No >= N/2{1 + {pi - p2)) = N/2 (l + 




where the last identity follows from (fTH]) . By ([9]) and (fT3]) . we have that Md^ — A/^^g only 
depends on no for both odd and even appearances of /ip. Therefore, if for a given updating rule 
the probability distribution of the random variable no is known, then (I20p can be computed. 
Clearly, we have 

^2 = Var{No - N/2) + {< No > -N/2f = Var{No) + {< No > -N/2f. (22) 
By replacing ([2T]) in ([22]) we get 

= N/A (^ + {N- 1) . (23) 

We can use this equation on odd (cTq) and even (cjg) occurrences. In odd occurrences of the 
present state jip equation ([9]) yields A/^g — A/^^g = and hence ([231) gives 

al = iV/4 (24) 

regardless the updating rule, as observed when the variable (y^/N is computed in numerical 
simulations. On even occurrences of -when the crowd effect arises- from ([23p and ([13p we 
obtain 



a'=N/A\l + {N-l) 




for a parity array with no states in odd ocurrences. Using that ^ = , £ = 2^ and 

r2no-i\ _ r2no\i2 gg^j^ write 

-e,n. = fl + .."""L f 1/2^- ) ) 1 . (25) 




;i + 2-«)2 

The role of the updating rule becomes relevant in the "typical" numbers no that appear when 
agents are processing a state /ip in an even occurrence. To be more precise, if p{no\lJip = e) 
stands for the probability of finding no states in odd occurrence given that fip is on an even 
occurrence then we have 

al = Y^p{no\^lp = e)(jl„^^. (26) 

rio 

We apply the obtained results to different kinds of updating rules. Our two first examples 
are based on the exogenous updating rules found in MGrand [2] and MGper [3]. In the end 
of the subsection we address the standard MG. Since the FSMG verifies the SPTD and our 
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calculations rely on the FSMG, we expect to find good results only in the region of validity of 
the PTD. 

In the M Grand J the present state fj,p is chosen at random (uniformly) from the whole set of 
states 5-^. In this case one can easily get 

Pinoll^p ^ e)MG..„, = 1/2^-' ~_ . (27) 

Indeed, if is in an even appearance, then its parity is recorded as odd in "P, i.e., V{fJ.p) = O, 
hence from the % — 1 remaining states /i we need to find the probability that exactly rio — 1 of 
them verify 'P(^) = O. Since each state is in odd occurrence with probability 1/2, (|27p follows. 
Now, we are ready to find an approximate value of (y\,iQ ^ = l/2a1 + l/2a1. From (j2^ . (p5|) . 
pG]) . and we obtain 

4,c_/A'^i/4+5(^^^^E (v2-(l":))'(r-l)- <^*' 

77-0 — 1 

It is not clear whether or not (j28p depends only on a = 2"^ /N = T-L/N. However, it can be 
approximated by ([29]) . which only depends on a (i.e., it follows the scaling relation cf./fj /A^ ~ 
/(a), [^). Indeed, due to the fact that in the M Grand approximately half of states will be in odd 
occurrences, we can only take one "typical" value = l-L/2 m. (|25p . neglecting the contribution 
of any other crg„^ to in (126p . By doing this we obtain the following approximation: 

<a..JN ~ 1/4 + {^/,)) - m- 

Taking into account that > 1, 2"^ = 2^'" > 1 (so that N-lr^N,l + ^r^l), and using the 
well known approximation (which can be obtained straightforwardly from Stirling's formula), 

win/2) ~ \/i we get 

^MG.UN-y^+^- (29) 

In Figure [1] we show the analytical result given by (I28p . the approximated expression ()29p . and 
the numerical results for the M Grand with = 4001. 



B ' r 




0.0001 0.001 0.01 0.1 1 

a 

Figure 1. The straight line shows /N as a function of a for the MGrand for 
different values of m (from 2 to 14) and = 4001. For each value of A'^ and m, 
100 runs have been performed, each one of T = 100000 time steps discarding the 
first 50000 steps. Empty circles show the analytical result ()28p . and star-shaped 
symbols show the approximated invariant (i.e., depending only on q) expression 
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Our next aim is to apply our calculations to the MGper introduced in [3]. In this case the 
updating rule follows a periodic pattern of period % that runs over all the states. In fact, with 
our convention that identifies any state ^ with its binary expansion plus one, the updating rule 
proposed in [3] follows the natural order 1 — )■ 2 — )• 3 • • • modulo %. In this case we have again 
'^i/Gper ~ ^/'^^l + l/2Ce- On the other hand we clearly get 

p{no\^lp = e)MGj,er = 

and then, from ^ together with ([25]), and ([26]) we find 



(30) 



2 



JN = 1/A + 



{N-l) 



87^(1 + 2-«)2 



E 



1 f2no 

22"° V Tin 



(31) 



In Figure [2] we show the agreement between (|3ip and numerical experiments for the MGper with 
N = 4001. In much the same way as for the MG^and^ this expression can be highly simplified. 

Indeed, taking into account that ^ 1, 2^ = 22"" ^ 1, and using again that Trk-i"^^"^ ~ \ ^— 
, we get 

that can be simplified once more 



N 



<iG,.rl^ ~ 1/4 + ^(log^ + 7), 



(32) 



mi: 

by using that limA/->oo Si<i<A/ 1/^ ~ log(M) — )• 7, where 7 = 0.57 ... is the constant of Euler- 
Mascheroni. Equation ([32]) is essentially the same obtained in [18] by means of a different 
approach. 




Figure 2. On the left, a'^/N as a function of a for the MGper for different values 
of m (in the range from 2 to 14) and =: + symbols for N = 501; x N = 1001; 
^ N = 2001; A N = 4001. For each value of N and m we perform 100 runs, 
each one of T = 100000 time steps discarding the first 50000 steps. On the right, 
the full line is the MGper with N = 4001, empty circles are given by ([3T]) . and 
M show the approximated expression ([32]) . 



Finally, we turn our attention to the standard AdG. As before, the key tool is equation (I26p . 
for which we need to know p(no|/Up = e)MG- We obtain this probability as follows. The updating 
rule in the standard MG consists in defining the present state as the last m outcomes of the 
game, in this way any chain of correlative states given by the updating rule can be seen as a 
walk on the De Bruijn diagram corresponding to each m |19j (Figure [3] shows the diagram of 
order 2). 

Note that each node in the De Bruijn diagram has two incoming links and two outgoing links. 
We performed numerical random walks obeying the SPTD on the De Bruijn diagram, i.e., in 
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Figure 3. De Bruijn diagram of order 2. All 4 possible fi states are represented: 
00, 01, 10 and 11. Links represent the possible sequences among these states 
under the rules of the MG. For example, after the state 00 there are two possible 
states which can follow: 01 and 00, and there are two possible preceding states: 
10 and 00. 



odd occurrences of each state we choose at random one from the two available states allowed by 
the diagram, and on even occurrences we choose the state opposite to the one selected in the 
previous odd appearance. We recorded at each time the value of Uq and after a simulation of 
50 different walks of 500000 steps, we obtained the approximated probability distribution of the 
variable rio, that can be seen in Figure [H 




Figure 4. Histograms of probability distribution p{no\fJ-p = e)MG (line) and 
pi^olfip = e)MGrand (dash) with parameters m = 2, m = 3, m = 5 and m = 8. 
These results were obtained by simulating the possible sequence of states under 
two assumptions: the sequence of states satisfies the rule of the MG (i.e., is a 
walk in the De Bruijn diagram), and SPTD is valid in the even occurrence of each 
state. The MGrand is displayed for illustrative purposes, since it is analytically 
given by (|27l) . 



.2 

'MG 



Using the obtained p{no\fip = e)MG, we computed 
Figure El shows the analytical and numerical results. 

The fact that the histograms given in Figure [4] agree for m 



by means of ([MI), ^ and (l26]) . 



This is consistent with numerical simulations in which crf^jQ 



2 is particularly interesting. 
.,r for m = 2 and = 4001 
(see Figure [6]). For m = 3, the histograms corresponding to MG and MGrand slightly differ but 
still are very similar, and a statistical test cannot reject the null hypothesis HO: "the values 
obtained from MG and MGrand are mutually consistent (i.e., correspond to variables with the 
same mean value)". 
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Figure 5. Left: jN as a function of a in the MG for different values of m 
(in the range from 2 to 14) and A^. The dash hue shows the case iV = 501; x 
N = 1001; the full line shows N = 2001; k N = 4001. For each value of and 
m we performed 100 runs, each one of T = 100000 time steps discarding the first 
50000 steps. Right: Numerical MG (the full line), and analytical result (o) for 
N = 4001. 

3.3. Biased Initial Scores. Our previous calculations were developed on the assumption that 
the strategies have zero virtual points at the beginning of the game. As another application of 
the FSMG, we consider the case in which an initial bias is introduced to the strategies scores. 
The first possibility could be the following (see Ref [21]): take a given Uq > and randomly 
assign Uq (instead of zero) virtual points to certain strategies belonging to the whole set of 
strategies of the FSMG. Following this idea, one is tempted to compute a variant of ([2]) taking 
into account such a bias. Regretfully, one can readily discover that the original symmetry of 
the FSMG has been broken, and it is not clear how to obtain an analogous of ([2]). However, a 
second possibility can be managed. Suppose that the bias is introduced at the agents' level, i.e. 
any agent randomly chooses, with a bias probability pb, to assign "a priori" Uq virtual points 
to each one of their strategies. This way of assigning initial scores differs from that studied in 
[9|I121[T0]. where a uniform initial bias (called i/q or q{0) in those works) is applied to each agent. 
It is important to notice that, in our approach at most a fraction 2pb(l — pb) (in average) of 
agents have different initial scores (in fact a difference of Uq points) between their strategies. 

For the sake of simplicity, we fix the bias probability pb = 1/2 although we explain below how 
to handle more general values (in fact, rational values of pb)- Let us introduce an "enlarged" set 
of strategies consisting in two copies 5^, of the original set of strategies Sc of the FSMG. In 
S}-, all the strategies have virtual points and Uq in 5?. Now, the extended FSMG^ consists in 
a single copy of any possible agent obtained from the enlarged set 5^'^ = ^^U^^. Following our 
previous ideas, the described biased MG can be thought as a statistical sample of the FSMG^ . 
The number of agents of the FSMG is now = (^2^) + 2C. The important fact in this 
scenario is that the symmetry still holds inside each 5^, and thanks to this one can replicate 
our previous arguments obtaining the analogous of ([2]) 

tJ^'j = ^Sl, + ^Sli = /:/2"° + (^^ J £/2-° . (33) 

where I ranges now from to Ti + Uo (notice that the combinatorial number (^) is zero if 6 > a 
or 6 < 0). Therefore, analogous for @ and ^ can be computed and hence the same thing can 
be done with ([7]). By doing this, one can get for the enlarged sets, 

\ \no-l J \no-l + UoJ \no - 1 - UoJ J 

V V / \no-Uo J \ no + Uo J J 
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Figure 6. The main figure shows (y'^-and/^ ~ (circles), a^^^/N = X2 (trian- 
gles) and a\jQ/N = X3 (diamonds) for games with = 4001 and different values 
of m in the range from 2 to 14. We simulated 100 independent runs of 100000 
time steps each. We average the mean value of jN using the last 50000 steps 
for each run. The figure inset reports the same values corresponding to m = 2 
with error bars (Axi, Ax2 and Axs, respectively). The bars overlap for this value 
of ra. We performed a statistical test on means in order to check the null hy- 
pothesis HO: "the three values are mutually consistent within these error bars, 
i.e., the obtained values of xi, X2 and 2:3 correspond to measurements of variables 
with the same mean value, and normal distribution" [20]. We computed the sta- 
tistical value S as the weighted sum of the squared deviations from the weighted 
average value of the three measurements (x), i.e., S = Yl^=i i^i ~ x)'^//S.xf where 
Xi and Axi are the variable and error corresponding to the i-th model and x is 
the maximum likelihood estimator of the mean of the three values. For m = 2, 
we obtained S = 1.30 and for this reason HO cannot be rejected, even when 
p{no\fJ.p = ejMGper clearly differs from p{no\np = e)MG and p{no\tJ-p = e)A^G™„d- 
If we perform the test only for the couple MG and M Grande we cannot reject 
HO (in this case HO: "the two values are mutually consistent") for m = 3, but 
we can reject it for m = 4, . . . , 9 with significance level smaller than 0.5%. For 
m = 3 and the three models we obtain S = 20.54 and, in this case, we reject HO 
with less than 0.5% of significance level, concluding that results from MGper are 
different from MG and MGj-and in games with m = 3 with a 0.5% probability of 
error of the second kind. 



when agents are processing a state in an even appearance. Furthermore, from these expressions, 
one can easily check that the SPTD is still verified in this new game. Therefore, proceeding 
as before we can use (j23p in order to get analytical approximations for a'^/N. In Figure [7] we 
compare our analytical calculations with numerical simulations of the MGj-and in which any 
agent chooses to assign, with probability 1/2, Uq virtual points (instead of zero) with Mq = 2 and 
tto = 4 to their strategies. The calculations were carried out by using ([27]) . as the conditional 

probabilities p(no|//p = e). Comparing with the unbiased 1 it is easy to prove 

that the bias implies a mitigation of the crowd effect, a fact that becomes apparent in Figure [71 



In order to find expressions for similar games with p = a/b {a rational number with a, b integer 
and positive numbers with b > a) then one needs to use an extended set of strategies in the 
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a 



Figure 7. Left: a'^/N function of a. Numerical simulations of the unbiased 
M Grand (fuH line), and biased M Grand with Uq = 2 (dashed line). Our analyt- 
ical calculation is displayed in full circles. Right: the same as left with = 4. 
In all the cases, = 1001, and for each value of and m we performed 100 
runs, each one of T = 100000 time steps discarding the first 50000 steps. 

following way: take b — a copies of the set Sc and keep the strategies in zero virtual points, then 
take a copies of the set Sc and assign Uo virtual points to their strategies. Consider the enlarged 
FSMG^ by taking a single copy of any possible agent with strategies taken from copies of Sc- 
Now, the pfe-biased MG can be understood as a sample of this game, and the calculations can 
be repeated step by step. Moreover, different Uq can be introduced in the same game by adding 
more copies of Sc- 

Since the influence of initial biased scores of the type described in this section does not seem 
to be too well documented in the existing literature, we illustrate the behaviour of the biased 
game by means of several numerical simulations given in Section 21 

3.4. Range of validity of PTD. If o was the outcome in the last odd occurrence of some state 
then for the next (hence even) appearance of ;U, equation (jlSp shows that the FSMG verifies 
the SPTD. Since the MG can be regarded as a statistical sample of the FSMG, this brings a 
natural way to approximate the probability of breaking the PTD for the first time in the MG- 
Indeed, if we denote with PpTD the probability of verifying the PTD for some configuration £ 
of the MG we clearly have 



On the other hand, from (jl9|) we see that PpTD can be easily calculated. In fact, by using the 
normal distribution approximation to the binomial (jl9p we get 



where $ stands for the cumulative function of the standard normal distribution. Probabilities 
Pi can now be calculated by means of ([T8|) and using ([13]). Since p3|) depends on rio, the values 
of p{no\iJip = e) are needed in order to find pi — p2- For the MG these values were obtained in 
Subsection 13 . 21 bv means of the de Bruijin graph, but since we do not have an analytic expression 
for them, we may use the values (j27p for the MGrand o,s an approximation. Moreover, a further 
approximation can be made, as it was mentioned before in the MGrand approximately half of 
states will be in odd occurrences, and therefore we can take Uq = T-L/2 as a "typical" value (in 
fact the inset of Figure [8] shows the analytical results for both approaches and there are not any 
noticeable differences) . 



PpTD = P{No > N/2). 




(35) 
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Taking a rough approximation by using only the typical Uq = T-L/2 we arrive to the compact 
expression 




It is important to note that our arguments are developed under the assumption of SPTD. Once 
the period two dynamics is broken, the system falls into states whose values of pi and p2 can 
not be calculated by means of this approach. This means that Eq. (|36|) is a good approximation 
for the probability of breaking PTD for the first time in a given realization of the MG. Despite 
the involved approximations, and as it is shown in Figure [HI theoretical results are in very good 
agreement with numerically computed values of PpTD for the MG. 




Figure 8. Probability Pptd for PTD to take place in even occurrences of the 
states as a function of a, calculated as one minus the probability of breaking the 
PTD for the first time in a given simulation of the game. The full line is the 
analytic case (by using the approximation Uq = "^72). Empty circles: results of 
the simulation of the MG. In the simulations we have computed Pptd as follows: 
at each even occurrence of each state, if after the poll the PTD is not fulfilled (i.e., 
the minority side agrees with that obtained after the previous -odd- occurrence 
of the same state), then we consider that this step does not contribute to Pptd- 
At the same time, present state and virtual points are assigned as if the PTD 
had not failed. This way we compute the probability of breaking the PTD for 
the first time in the game. The empty circles show the average of 100 runs of 
100000 steps each, for N = 501, 531, 561, 591, 621, 651, 681, 711, 741, 771 and 
m = 2, . . . , 7. Inset: comparison between the expression (I36p (the full line) and 
the analogous obtained by using the complete distribution of p{no\^p = e) given 
by (f27l) (full circles). Both curves are indistinguishable. 

4. Numerical simulations for the biased MG 

In this section we present numerical results addressing the behaviour of the biased MG studied 
in Subsection lS.Sl We first plot jN vs. a for different values of initial scores, always for p = 1/2 
as the probability of assigning the initial scores to the strategies. The main plot in Figure [9] 



THE FULL STRATEGY MINORITY GAME 



16 



shows these results for Uq = 2, 6, 10. Initial scores decrease the waste of resources only during the 
crowd dynamics. In the inset of the same figure, we present detailed results for Uq = 4, 6, 8, 10, 
zooming on the range at which these curves change their curvature. Let us mention that, from 
equation ([3^ . it is easy to see that for values of Uq greater than Uq — I, the expression for 
- A/"^, becomes independent of Uo (indeed we get A/"^ " -^^g = /2'^"-°2[^^°zl))- In 
particular, taking the "typical" Uq = 71/2 for the MGrandi we see that if ^ 'H/2 — 1, then a 
curve of vs. a for a given value of Uq, overlaps with any other curve given by a greater 

values of Uo in the region of m < 1 + log2(^io + 1) (see Figure 9, e.g., if = 6 then the condition 
is met for m < 3.8). 

Figure [To] a) and Figure [TOl b) show a'^/N and the probability PpTD-, respectively, vs. the 
initial scores for several values of a. In all cases, N = 1001 is considered. Let us notice that 
PTD is valid in the biased model for m = 2 (a ~ 0.004) in all the range of initial biased values. 
If m = 5 (a ~ 0.032) then PTD is broken for = 6, if m = 6 (a ~ 0.064) then PTD is broken 
for Uo = 3, and if m = 7 (a ~ 0.128) or m = 8 (a ~ 0.256) then PTD is broken for all values of 
biased scores starting from = 1, as in the case of unbiased MG (although for m = 7, PpTD 
is very close to 1 when Uq = 1). 

It is worth to remark that since the maximum of 2pb(l — pi^) is reached for Pb = 1/2 (in fact, 
this is the case in our simulations) , we see that at most half of the agents (in average) are allowed 
to have different initial point scores for their strategies. Remarkably, as it is easily deduced from 
our calculations the reduced variance a'^/N behaves as a~^, as Uo — ?• oo. To be more precise, 
for each a > it is enough to take Uo > Na = Ti in order to obtain an analogous of equation 
(1291) . that reads: 

1 N 

in contrast with the case of uniform bias [3 [12], for which a'^ /N ~ a when — )■ oo. 




Figure 9. a'^/N as a function of a. Numerical simulations of the unbiased MG 
(dash line), and biased MG with Uq = 2 (filled circles), Uo = 6 (filled squares), 
and Uq = 10 (filled triangle). The inset shows results for values of Uq = 4, 6, 8, 10, 
using empty squares for Uo = 4, empty triangles for Uo = 8, and the same symbols 
as in the main figure for the other cases. Curves in the inset are zooming in 
the range at which these curves change their curvature. Let us mention that 
curves overlap for values of m < 1 + log2(uo + !)> as can be predicted from our 
calculations. In all the simulations, = 1001. We performed 50 runs, each one 
of T = 100000 time steps discarding the first 50000 steps for cases Uq = 0, 2, 4, 6, 
and T = 50000 time steps discarding the first 10000 steps for the cases Uo = 8, 10. 
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initial bias initial bias 



Figure 10. Left: a) a'^/N vs. initial scores at a fixed value of a (we take 
= 1001, and m = 2 • • • 8), with filled circles (m = 2), diamonds (m = 3), empty 
squares (m = 4), X symbols (m = 5), filled squares (m = 6), filled triangles 
(m = 7) and empty triangles (m = 8). Right: b) PpTD in even occurrences of 
the states as a function of initial scores at a fixed value of q. In the simulations 
we have computed PpTD as we did in Figure [8] for the cases N = 1001 and 
m = 2,5,6,7 and 8. Symbols are the same as in figure a). In both figures, we 
performed 50 runs, each one of T = 100000 time steps discarding the first 50000 
steps for cases Uq = 0, 2, 4, 6, and T = 50000 time steps discarding the first 10000 
steps for the cases Uq = 1, 3, 5, 7, 8. 

5. Conclusions 

In this paper we showed that the FSMG can be useful to understand certain features of the 
MG in the symmetric phase. The FSMG is a maximal instance of the MG, where a single 
copy of every potential agent takes part of the game. Due to this fact, several symmetries can 
be exploited, allowing us to obtain analytical solutions for the FSMG. These theoretical results 
were used to compute approximated values of the key variable a'^/N for the standard MG, as 
well as for other versions based on different updating rules that can be found in the literature. 
It is also shown that our technique allows to handle certain cases of strategies with biased initial 
scores. We were able to show that the FSMG enjoys the strict period two dynamics, a fact 
that led us to a simple way of computing the probability of breaking the period two dynamics 
for the first time in a given realization of the MG. We are convinced that the FSMG and the 
framework presented can be useful, in the symmetric phase, for other variants of the MG. 
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